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ABSTRACT 

We use measurements and models to develop aerosol models for use in the inversion algorithms for the Cloud 
Aerosol Lidar and Imager Pathfinder Spaceborne Observations (CALIPSO). Radiance measurements and 
inversions of the AErosol RObotic NET work (AERONET 1 2 ) are used to group global atmospheric aerosols using 
optical and microphysical parameters. This study uses more than 10 5 records of radiance measurements, aerosol 
size distributions, and complex refractive indices to generate the optical properties of the aerosol at more 200 sites 
worldwide. These properties together with the radiance measurements are then classified using classical clustering 
methods to group the sites according to the type of aerosol with the greatest frequency of occurrence at each site. 
Six significant clusters are identified: desert dust, biomass burning, urban industrial pollution, rural background, 
marine, and dirty pollution. Three of these are used in the CALIPSO aerosol models to characterize desert dust, 
biomass burning, and polluted continental aerosols. The CALIPSO aerosol model also uses the coarse mode of 
desert dust and the fine mode of biomass burning to build a polluted dust model. For marine aerosol, the 
CALIPSO aerosol model uses measurements from the SEAS experiment \ In addition to categorizing the aerosol 
types, the cluster analysis provides all the column optical and microphysical properties for each cluster. 
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1. INTRODUCTION 

Global aerosol properties are important not only for use in radiative transfer models but also as inputs to the 
inversion algorithms of satellite -based measurements of aerosol optical properties. As we approach an era of 
unprecedented global coverage of aerosol profile measurements (GLAS, CALIPSO) to augment the satellite 
measurements of aerosol optical depths by passive instruments (MODIS, MISR, POLDER, TOMS, SAGE) there 
is a need for well characterized aerosol properties. In its most recent report, the Intergovernmental Panel on 
Climate Change 4 found that radiative forcing by aerosols is the most uncertain of all radiative forcing estimates. 
Reducing these uncertainties calls for expanded aerosol measurements and studies to characterize different types 
of aerosols and sources 5 . Tropospheric aerosols are diverse and their properties depend on sources, emission rates, 
and removal mechanisms and can be highly variable. To understand and quantify aerosol effects, there have been 
several domestic and international campaigns to characterize aerosol physical and chemical properties, and 
process studies. These include the Aerosol Characterization Experiments (ACE -1, -2, -Asia), the Tropospheric 
Aerosol Radiative Forcing Observational Experiment (TARFOX), the Smoke, Clouds, and Radiation-B (SCAR-B) 
experiment, and the Indian Ocean Experiment (INDOEX). ACE-1 took place south of Australia in November- 
December 1995 and measured properties of the natural aerosol in the remote marine boundary layer 6 . ACE-2 took 
place in the North Atlantic Ocean in June- July 1997, and focused on the radiative effects and processes controlling 
anthropogenic aerosols from Europe and desert dust from Africa as they were transported over the North Atlantic 
Ocean 7 . ACE-Asia took place during the spring of 2001 off the coast of China, Japan and Korea. This region 
includes many types of aerosol particles of widely varying composition and sizes derived from one of the largest 
aerosol source regions on Earth. ACE-Asia made several important measurements of wind-blown dust, urban 
pollution and marine aerosols. TARFOX 8 , designed to measure and analyze aerosol properties and effects along 
the US eastern seaboard, took place on July 10-31, 1996. INDOEX 9 measured aerosol properties over the tropical 
Indian Ocean in 1998 and 1999. The large aerosol optical thicknesses (~ 0-5) and the prominent role of the 
carbonaceous aerosols in the extinction budget during most of INDOEX underline the need to develop long-term 
records of these species. 

In many cases, we are likely to use mean properties sorted by location or type to represent aerosol characteristics 
in both radiative transfer calculations 10-13 and inversion algorithms of satellite measurements l4 ' L \ Assigning a set 
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of mean aerosol physical and chemical properties to a given location based on a long-term average has significant 
shortcomings. This is because at any given location, the aerosol type is highly variable on time scales as short as a 
few hours 16 17 . These variations result from transport of distinct airmasses to a site and non-systematic events such 
as fires, wind gusts, hurricanes, tornadoes, and land clearing and development activities. These variations lead to 
diverse aerosol characteristics at each site on time scales of a few hours and preclude the long term averaging of 
properties to develop a representative set of characteristics for a site or region. Aerosol optical measurements must 
therefore be made at short time scales (about 30 minutes) to develop a large data base which can be used to derive 
statistically significant correlations. The AErosol RObotic NETwork (AERONET 1 ) measurements are likely to 
provide a data base with a fine temporal resolution albeit for column rather than vertically-resolved measurements. 
AERONET is an automatic robotic Sun and sky scanning measurement network that has grown rapidly to over 200 
sites worldwide. AERONET uses multi-angle radiance measurements to retrieve the discrete aerosol size 
distributions in 22 size bins ranging from 0.05 to 15 pm and the complex refractive index. The network has the 
important features of uniform data collection, calibration, and data processing procedures. This study uses the 
whole AERONET record of measurements and inversions to develop a type-specific set of mean optical properties 
of aerosols. Cluster analysis is used for categorization of atmospheric aerosol types. Six significant types: desert 
dust, biomass burning, polluted continental, clean continental, marine aerosol, and dirty pollution are suggested by 
the cluster analysis. In this classification, clean continental refers to a lightly-loaded soot-free pollution normally 
found in rural areas and is good approximation for background aerosol. Dirty pollution refers to pollution 
containing significant amounts of absorbing species. 

2. AERONET DATA SET 

An attempt is made to use all the available data set of AERONET measurements dating back to 1993 for some 
sites. These measurements are frequently contaminated by clouds and depending on the cloud reflectivities, can 
have a significant effect on the sun-sky radiance measurements. This study used the AERONET Level 1 size 
distribution data and applies a two-part cloud screening scheme. The first part checks the symmetry of the 
almucantar measurements and the second part is a statistical screening procedure. The almucantar measurement is 
made at several azimuthal angles with the same elevation angle of the direct sun. For the aureole measurement, the 
degree of angle change is set to be quite small near the direction of sun. In order to ensure that the sky is clear at 
the time of the measurement, we calculate the relative error between the seven pairs of data measured on either 
side of the direct sun. The azimuthal angles of these measurements are 2.0°, 2.5°, 3.0°, 3.5°, 4.0°, 5.0°, and 6.0°. 
Using this method ensures that the sky is clear near the measurements because the presence of cloud cover would 
result in disparities between symmetrical pairs of measurements. The statistical screening procedure is based on 
the cloud screening method for the direct solar measurement of AERONET sun/sky radiometer. In this procedure, 
records with optical depths(x) and Angstrom coefficients (a) that exceed a fixed number of standard deviations (a) 
on either side of the mean of the distribution are not included in the analyses. The range of acceptable optical 
depths is T mean -o to T llieiln +3c7. The acceptable Angstrom coefficients are greater than a mean -3o. These conditions 
assume that unrealistically large aerosol extinction values are due to cloud contamination or other transient 
phenomena. A very small Angstrom coefficient is also indicative of cloud contamination. This cloud screening 
scheme on the average rejects about 60% of the data. 

3. CLUSTER ANALYSIS 

Cluster analysis is a statistical tool used for grouping large data sets into several categories using predefined 
variables. In this study, cluster analysis by partitioning 18 is used to categorize the AERONET data set based on 
several optical and physical characteristics of the aerosol. The cluster algorithm uses the 26 parameters in Table 1. 
Note that the optical depths are not used in the cluster analysis because these depend on the amount of aerosol 
rather than type. The first two columns of the table are properties from the AERONET inversion algorithms. In the 
third column are optical properties that can be generated from AERONET inversions of size distribution and 
complex refractive indices using scattering calculations such as Mie modeling 19 , T-Matrix 20 or Discrete Dipole 
Analysis 21 depending on the scattering calculation used in the AERONET inversion algorithm ~ 2 23 . After 
clustering, the individual sites are inspected to find the frequency of records in the categories. Site belongs to a 
category x if more than 30% of the records at that site are grouped in cluster x. It is therefore possible for a site to 
belong to more than one category. This means that the site experiences a high frequency of more than one type of 
aerosol. 
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Table 1. Parameters used in the cluster analysis. The numbers in parentheses denote the number of variables for the given 
property, e.g., there are two geometric mean radii - a fine and coarse radius. 


Composition 

Size distribution 

Optical Properties 

Complex refractive 

Geometric mean 

Single scattering 

index (8) 

radius (2) 

albedo (4) 


Geometric standard 
deviation (2) 

Asymmetry factor (4) 


Mode total volume 

Extinction to 


(2) 

backscatter ratio (4) 


The AERONET algorithm for size distribution retrieval provides the volume distribution data of 22 size bins 
(dV/dlnr, V is volume, and r is radius) from 0.05 pm to 15 pm. The best fit for the AERONET size distribution 
data is a two-mode log-normal size distribution described by equation (1), 
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where the Cs are total mode volume (prnVpm 2 ) and the subscripts f and c denote fine and coarse modes, 

respectively, o is the geometric standard deviation and r is the geometric mean radius. This partition of the size 
distribution into fine and coarse modes yields six parameters by which the size distribution can be described. 
Considering both anthropogenic and natural sources, atmospheric aerosols occur in at least five distinct classes: 
marine aerosol, desert-dust, biomass burning aerosol, urban aerosol and rural -background aerosol. This forms the 
minimum number of aerosol clusters. The actual number of clusters is determined by calculating the clusters using 
successively larger cluster numbers until the calculation does not yield any new significant clusters. Having 
defined the number of categories, a random function is used to determine the initial conditions of each variable. 
These initial coordinates of the variables are located within one standard deviation of the mean assuming a normal 
distribution of the variables. The data clusters formed in this way group all records that have statistically 
significant similarities in one category. At each iteration, each record is assigned to the cluster whose center is 
"closest" (using a Euclidean distance metric) to the record. When all records have been assigned to individual 
clusters, new cluster centers are determined by averaging the variables in each cluster. The process is repeated 
with the new cluster centers until the relative error between the centers of new and old clusters is less than a 
prescribed value. The distance (normalized by the standard deviation to eliminate bias resulting from the different 
magnitudes of the variables) between a record and the center of category j is calculated using equation 2, 


normalized distance(j) = 



data record(i)-center(j,i) 
{standard deviation(i)}" 


( 2 ) 


The six distances are compared and each record is grouped in the category with the shortest distance to the cluster 
center. After grouping the records into the six groups, a new center for each cluster is determined by averaging the 
variables in each cluster. The process is repeated with the new cluster centers until the relative error between the 
centers of new and old clusters is less than 0.1%. Convergence is achieved within 25 iterations for the various 
initial conditions. Figure 1 shows a flowchart of the clustering algorithm. 
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Figure 1. Flowchart of the clustering algorithm used to group 143,913 AERONET records into clusters of with similar 
composition, size distribution, and optical properties. The convergence is achieved when the cluster centers in 26-parameter 
space do not change by more than 0.1% in successive iterations. 


4. THE CATEGORIES OF AEROSOL 

The AERONET data set yielded six distinct clusters described by the column optical and physical properties. 
After the algorithm has generated the number of aerosol categories, each category is assigned an aerosol type. 
Aerosol typing of the categories relies on telltale properties such as the fine and coarse fractions, particle size, 
optical depth, geographic location and in some cases seasonal variation. The effects of non-sphericity on the 
cluster analysis are unknown but not negligible. Table 2 shows the properties of the six clusters. The values shown 
in the table are some of coordinates in 26-parameter space of the centers (medoids) of the clusters. The properties 
of the medoids along with the geographic locations of the sites suggest that categories 1 to 6 are desert dust, 
biomass burning, rural (background), industrial pollution, marine, and dirty pollution, respectively. 

The size distributions and the location of sites belonging to category 1 are Banizoumbou, Ouagadougou, Cape 
Verde, llorin, Solar Village, Bahrain, Kanpur, Dahkla, Sede Boker, Nes Ziona, Maricopa, Ascension Island, La 
Paguera, and Yulin shown in Fig. 2. All these sites are in desert regions or close to deserts or in regions where 
desert dust as a result of long-range transport has been observed 24 ' 27 . The mean radius and geometric standard 
deviation for the fine (coarse) mode is 0.12 pm and 1.5 (2.8 pm and 1.91), respectively. The ftne/coarse aerosol 
ratio by volume is 0.22, i.e., coarse sizes are dominant as shown in Fig 3a. The medoid properties of these aerosols 
are consistent with other measurements dust aerosols. The global aerosol model of d’Almeida et al. 28 estimate dry 
accumulation and coarse mode radii for dust of 0.27 pm and 4.00 pm, respectively. Tanre et al. used spectral 
measurements from both Landsat TM and AERONET to find size distributions for dust aerosols of 0.5 pm for the 
fine mode and 1- 5 pm for the coarse mode. Real refractive indices from the same study ranged from 1.46 - 1.52. 
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Table 2. Summary of the cluster analysis results for 143913 records. The values are the center of each cluster (medoid) and the 
membership coefficient is an indication of the ‘tightness’ of each cluster. 



Cat 1 

Cat 2 

Cat 3 

Cat 4 

Cat 5 

Cat 6 

Number of Records 

22202 

26662 

20307 

55667 

6527 

12548 

Single Scat. Albedo (673 nm) 

0.93 

0.80 

0.88 

0.92 

0.93 

0.72 

Real Refractive Index (673 nm) 

1.4520 

1.5202 

1.4494 

1.4098 

1.3943 

1.4104 

Imaginary Refractive Index (673 nm) 

0.0036 

0.0245 

0.0092 

0.0063 

0.0044 

0.0337 

Optical Depth (673 nm) 

0.327 

0.190 

0.036 

0.191 

0.140 

0.100 

Angstrom Coefficient (441/673) 

0.608 

1.391 

1.534 

1.597 

0.755 

1.402 

Angstrom Coefficient (673/873) 

0.486 

1.332 

1.381 

1.536 

0.678 

1.232 

Angstrom Coefficient (873/1022) 

0.277 

1.043 

0.950 

1.290 

0.531 

0.846 

Asymmetry factor (673 nm) 

0.668 

0.603 

0.580 

0.612 

0.711 

0.594 

Fine mean radius (|xm) 

0.117 

0.144 

0.133 

0.158 

0.165 

0.140 

Geo. Standard Dev. (fine) 

1.482 

1.562 

1.502 

1.526 

1.611 

1.540 

Fine total volume (|im 3 /(a.m 2 ) 

0.077 

0.040 

0.013 

0.061 

0.029 

0.032 

Coarse mean radius (pm) 

2.834 

3.733 

3.590 

3.547 

3.268 

3.556 

Geo. Standard Dev. (coarse) 

1.908 

2.144 

2.104 

2.065 

1.995 

2.134 

Coarse total volume (pm 3 /pm 2 ) 

0.268 

0.081 

0.020 

0.054 

0.083 

0.034 

Fine fraction by volume 

0.22 

0.33 

0.38 

0.53 

0.26 

0.49 

Membership Coefficient 

0.92 

0.93 

0.94 

0.93 

0.91 

0.89 



Figure 2. The locations of the six categories. The locations and properties of the categories 1, 2, 4, 5 and 6 are 
consistent with desert dust, biomass burning, continental pollution, marine and dirty pollution, respectively. 
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Figure 3. Size distributions of the cluster medoids of the six aerosol categories 


Using airborne particle counters, a ground based lidar and sunphotometer, Di lorio et al. 26 estimated complex 
refractive indices of (1.52 to 1.58) - (0.005 to 0.007)i for dust aerosols. In the same study they estimate single 
scattering albedos ranging from 0.71-0.75. Category 1 aerosols have a mean single scattering albedo of 0.93 which 
is closer to the values reported by others 24 

Category 2 sites include Skukuza, IMS Metu Erdeml, Egbert, Abracos Hill, Mongu, Cuiaba, Los Fieros, Yulin, 
CEILAP-BA. The size distributions are described by fine (coarse) radius and standard deviation of 0.15 pm, and 
1.62 (3.8 pm, and 2.1), respectively. The fme-to-coarse aerosol ratio based on the volume distribution is 1.0. Real 
and imaginary parts (m, and m;) of the refractive idex of 1.46 and 0.03 are consistent with measurements of 
Dubovik et al. 13 and Ansmann et al. 15 . Using spectrally resolved particle backscatter and extinction 
measurements, the effective particle radius of free tropospheric aerosol advected from the Indian subcontinent and 
primarily composed of biomass burning pollution is estimated at 0.17 pm 29 . Muller et al. 30 found effective radii 
ranging between 0.14 and 0.22 pm, complex refractive indices of 1.65 - (0.03 to 0.08)i and single scattering 
albedos of 0.79 to 0.86 at 532 nm for the same aerosol plumes. A series of measurements of both microphysical 
and optical properties of biomass burning aerosols made during the SCAR-B campaign 31 found values of size 
distributions, single scattering albedos, complex refractive indices, and optical depths not inconsistent with the 
values reported for category 2 in Table 2 above. 

The Category 3 aerosol is characterized by a low optical depth (0.04) and a relative large frequency of incidence 
in areas where the atmosphere is expected to be relatively clean, i.e., clean continental or rural background. The 
members of this category include Mauna Loa, Rairoad Valley, Saturn Island, Sevilleta, Rogers Dry Lake, Bonanza 
Creek, Rimrock, Maricopa, H. J. Andrews, Howland, Konza EDC, Bethlehem, San Nicolas shown in Figure 2. 
The fme/coarse mode ratio by volume is 0.4 and a fine (coarse) radius and standard deviation of 0. 13 pm, and 1 .50 
(3.6 pm, and 2.1), respectively. Most of these sites are continental with the exception of Mauna Loa. Though this 
category has a large number of records (20307), the low optical depth of the measurements means necessarily that 
the retrieval accuracy is significantly deteriorated. For all AERONET measurements the accuracy of the single 
scattering albedo, and refractive indices significantly decreases for 441 nm optical depths of less than 0.2 25, 32 
Category 4 aerosols are most prevalent at Walker Branch, Venice, ISDGM CNR, Moscow MSU MO, MD Science 
Center, Columbia SC, GSFC, Wallops, SERC, Avignon, UCLA, Cart Site, Mexico City, Toulouse, Anmyon, Lille, 
Moldova, Stennis, Rome Tor Vergat, Konza EDC, Rio Branco, Nes Ziona, Ispra, Shirahama, La Jolla, Bondville, 
COVE, Dry Tortugas, Alta Floresta, Arica, Fresno, GISS, Sao Paulo, Gotland, Bethlehem, Abracos Hill, Bahrain, 
Sioux Falls, Los Fieros, Corcoran, Sede Boker, Kaashidhoo, El Arenosillo, Bermuda, Bordeaux, Mongu, 
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Waskesiu. The fine fraction by volume is 0.53 and the size distribution is described by a fine (coarse) radius and 
standard deviation of 0.16 |im, and 1.53 (3.5 |im, and 2.1), respectively. The location of the aerosols (near or in 
urban centers), and the composition (mean refractive index of 1.41 - 0.006i) are consistent with urban pollution. 
Category 5 sites are Lanai, Tahiti, Arica, La Paguera, Dry Tortugas, Ascension Islan, La Jolla. The fine-to-coarse 
volume ratio is 0.26 and the refractive index is 1.39 - 0.004i at 673 nm. The global aerosol models of d’Ameida et 
al . 28 estimate an oceanic aerosol refractive index of 1.38 with a negligibly small imaginary part. All the sites are 
islands or coastal sites. The size distribution is described by a fine (coarse) radius and geometric standard 
deviation of 0.17 pm, and 1.6 (3.3 pm, and 2.0), respectively. The coarse mode is most likely seasalt. The fine 
mode, whose contribution is most likely responsible for the enhancement of the imaginary part of the total 
refractive index, is most likely smoke or biomass burning aerosol. It is therefore likely that Category 5 is marine 
aerosol mixed with biomass burning smoke or industrial pollution. 

Category 6 sites are Egbert, Skukuza, Bordeaux, Etosha Pan, Belterra, IMS Metu Erdeml, Waskesiu, 
Dalanzadgad. Three of these sites (Skukuza, IMS Metu Erdeml, Dalanzadgad) are also found in category 2. This 
class of aerosols (referred to as dirty pollution) is characterized by a size distribution similar to category 4 
(industrial pollution) with a larger imaginary index of refraction (0.034) and small single scattering albedos. These 
are most likely aerosols with a large fraction of elemental carbon. 

Only categories 1, 2, and 4 are used for the CALIPSO models. Categories 3 has low mean optical depths and 
therefore the retrievals are highly uncertain, and category 5 has a small number of records. 

5. SUMMARY OF THE AERONET CLUSTER STUDIES 

The AERONET classification yields five groups of aerosols shown in Table 3. The table also shows the main 
constituent species of each type. Desert dust is mostly mineral soil. Biomass burning is an aged smoke aerosol 
consisting primarily of soot and organic carbon (OC). Background aerosol is a lightly loaded aerosol consisting of 
sulfates (S0 4 2 ), nitrates (NO 3 ), OC, and Ammonium (NH 4 + ). Marine aerosol consists primarily of seasalt (NaCl). 
Both polluted continental and dirty pollution consist of the same species but the large imaginary part of the 
refractive index of dirty pollution suggests that this type of aerosol contains a significantly larger (than polluted 
continental) fraction of soot. 


Table 3. Aerosol types determined by the cluster analysis of AERONET data 


Cluster Category 

Aerosol Type 

1 

Desert dust (Mineral dust) 

2 

Biomass burning (soot + OC) 

3 

Background/rural (S0 4 2 ', N0 3 ", OC, NH 4 + ) 

4 

Polluted continental (S0 4 2- , N0 3 ", OC, NH 4 + + soot) 

5 

Marine(NaCI) 

6 

Dirty pollution (S0 4 2- , NO 3 ', OC, NH 4 + + soot) 


6. CONSISTENCY OF AEROSOL MICROPHYSICAL PROPERTIES 

To check for the consistency of the individual categories, we divided the measurements in each category into five 
optical depth classes and plotted the size distributions of each class within a category. The results are shown in 
Figure 4. Note that for each category, the magnitude of the fine and coarse mode amplitudes of the size 
distribution denote fine and coarse loading, respectively. As shown, the mode amplitudes increase with the optical 
depth for all types. The optical depth is an extensive property as is the mode amplitude and this behavior is 
expected for a given aerosol type. The fine and coarse mean radii and geometric standard deviations, on the other 
hand, are intensive properties, i.e., properties that depend on the type of aerosol and not the the amount or loading. 
These are relatively constant across optical depth classes within categories. This means that despite changes in the 
optical depth, the aerosol type is consistently the same in each category. While this is not a sufficient validation of 
the clustering method, it is an indication of the ability of the algorithm to group similar data sets using the 
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prescribed variables. Since some of these records span a period of nearly a decade, the most conclusive validation 
of the clustering algorithm would be to examine a climatology of in situ measurements at the individual sites. 
While such in situ data may become available for some sites in future, there is currently no climatology of aerosol 
types at any of the AERONET sites. Even if such a climatology were available, it would only be useful in 
locations were a given type of aerosol (say polluted continental) was dominant almost to the exclusion of all other 
types. For sites that experience several different types of aerosols, climatologies based on time averages longer 
than a few hours would be misleading. 
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Figure 4. Size distributions of dust (a), biomass burning (b), background/rural (c), polluted continental (d), marine (e), and 
dirty pollution (f) aerosol, i.e., categories 1, 2, 3, 4, 5, and 6 respectively. The data in each category was partitioned into five 
aerosol optical thickness (AOT) classes. 





7. CALIPSO AEROSOL MODELS 

Fig. 5 (a)-(f) show the physical and chemical properties of the CALIPSO aerosol models and the corresponding 
extinction/backscatter values at 532 nm and 1064 nm. The AERONET cluster analysis yielded six distinct types of 
aerosol. Only three of these clusters (desert dust, biomass burning, polluted continental, Fig. 6a, b, and d, 
respectively) were used to characterize the CALIPSO aerosol models. 



Figure 5. The size distributions and microphysical properties of the CALIPSO aerosol models. For each model the 
extinction/backscatter ratio S a (in sr) at 532 nm (top) and 1064 nm (bottom), is shown in the left hand quadrant. r g v 
and 0 g v are the geometric mean radii and standard deviations of the distribution respectively, m, is the complex 
refractive index at wavelength i in nm. Cf and C v are the fine and coarse volume fractions of the distribution. 
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The CALIPSO model of background and marine aerosols are not derived from the AERONET measurements. The 
AERONET records of the background cluster had low mean optical depths (< 0.05 at 673 nm). The microphysical 
properties derived from these are likely to have large uncertainties 25 . The CALIPSO background aerosol model 
(Fig. 5c) was derived by fitting size distributions and refractive indices to measurements of extinction/backscatter 
ratios of long-range continental transport 33 . The AERONET marine aerosol cluster is comprised of a small 
number of records (< 4% of the total). The CALIPSO marine aerosol model (Fig. 5f) is derived from the 
parameters measured during the SEAS experiment 3 . The CALIPSO polluted dust is a mixture of the AERONET 
desert dust (coarse mode) and biomass burning (fine mode) clusters (Fig 5e). 


8. CONCLUSION 

We have used measurements and models to develop aerosol models for use in the CALIPSO lidar inversion 
algorithms. The CALIPSO aerosol model consist of six aerosol types: Desert dust, biomass burning, background, 
polluted continental, polluted dust, and marine. A global data set, AERONET, has been used to identify main 
clusters of aerosol types and to determine microphysical properties of aerosol groups. This characterization is 
augmented by measurements where the uncertainty in the AERONET retrievals is expected to be high. The effects 
of non-sphericity on the cluster analysis are unknown but not negligible. These need to be addressed in future 
studies. The CALIPSO models will evolve with new measurements and, upon launch the CALIPSO data will 
significantly enhance the available data base of the characteristics of aerosol types. Such data sets will be used to 
further refine the probability distribution functions and threshold values for use in the algorithms. 
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